A Noise Robust Multilingual Reference Recogniser Based on Speechdat(II)
نویسندگان
چکیده
An important aspect of noise robustness of automatic speech recognisers (ASR) is the proper handling of non-speech acoustic events. The present paper describes further improvements of an already existing reference recogniser towards achieving such kind of robustness. The reference recogniser applied is the COST 249 SpeechDat reference recogniser, which is a fully automatic, language-independent training procedure for building a phonetic recogniser (http://www.telenor.no/fou/prosjekter/taletek/refrec). The reference recogniser relies on the HTK toolkit and a SpeechDat(II) compatible database, and is designed to serve as a reference system in multilingual speech recognition research. The paper describes version 0.96 of the reference recogniser which take into account labelled non-speech acoustic events during training and provides robustness against these during testing. Results are presented on small and medium vocabulary recognition for six languages.
منابع مشابه
The COST 249 SpeechDat Multilingual Reference Recogniser
The COST 249 SpeechDat reference recogniser is a fully automatic, language-independent training procedure for building a phonetic recogniser. It relies on the HTK toolkit and a SpeechDat(II) compatible database. The recogniser is designed to serve as a reference system in multilingual recognition research. This paper documents version 0.93 of the reference recogniser and presents results on sma...
متن کاملPhoneme-based recognition for the norwegian speechdat(II) database
This paper presents results from a number of exible vocabulary recognition experiments on the Norwegian SpeechDat(II) database. A common phoneme-based recogniser design procedure is tested on ve di erent tasks, and for ve di erent training sets. Results verify that reasonably accurate recognisers can be built with the database, using standard HMM techniques. They also quantify the importance of...
متن کاملPredstavitev učinkovitega postopka za robustno avtomatsko razpoznavanje govora
Extended abstract. Many automatic speech recognition systems, which operate in a laboratory environment, achieve high recognition rates. As speech recognition has moved from the laboratory to the field, however, recognition scores drop significantly. Robust speech recognition refers to the problem of designing an automatic speech recogniser that works well in a wide range of unexpected or adver...
متن کاملThe Development and Integration of the LDA-Toolkit Into COST249 SpeechDat(II) SIG Reference Recognizer
This paper presents the development of Linear Discriminant Analysis toolkit (LDA-Toolkit) and its integration into widely used COST249 SpeechDat(II) Task Force Reference Recognizer (RefRec). The crucial parts of the LDA, the determination of LDA classes, as well as the influence of the level of dimensionality reduction on automatic speech recognition performance, are discussed. Evaluation of pr...
متن کاملCrosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clustering
The paper describes our ongoing work on crosslingual speech recognition based on multilingual triphone hidden Markov models. Multilingual acoustic models were built using two different clustering procedures: agglomerative triphone clustering and tree-based triphone clustering. The agglomerative clustering procedure is based on measuring the similarity of triphones on a phoneme level where the m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000